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QAT as a Platform Component 
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Three Separate Subsystems 


° Server class Platform Control Hub (PCH) via Direct Media 
Interface (DMI) Gen 3 x4 


* JIO (4x10G with iWARP RDMA) 
* Intel® QAT 

- Crypto and Compression Hardware Acceleration 

- >100 Gbs Bulk Crypto 

- 100 kops RSA 2 k Decrypt 

- 68 Gbs Verified Compression 

- 160 Gbs Decompression 

- Supports SR-IOV with 3 PF, 48 Trusted VFs 

(Acceleration and I/O via PCle* x16 and/or x8 Gen 3) 

Supports multiple configurations 


° Down on Motherboard as PCH, Ethernet and/or Intel® QAT 
Acceleration 


and/or 
° PCle* add-in card for Ethernet and/or QAT Acceleration 


NOTE: Separate power planes and Connectivity for power optimized 


configurations 
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Software Stack 


* Intel® QAT zip is a high performance user space librar 
designed for environments requiring both highly threaded 
processes and/or many processes 


° Intel® QAT user space library provides access to both the 
Intel® QAT out-of-tree driver and directly to the Intel® QAT 
device via memory mapped address space 


° Out-of-tree driver is a superset of the in-tree drive 
User space applications via Intel® QAT API, via “direct model”, for instance, 
queues are mapped to user space (for performance) 


° User-Space DMA-able Memory Library Driver also supplied 


e QAT Engine is a .so that plugs into the engine framework of 
openSSL, providing crypto services from Intel® QAT 
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Nginx Configuration File 


load module modules/ngx http gatzip filter module.so; 
gatzip on; 
gzip proxied any; 
gatzip min length 128; 
gatzip comp level 1; 
qatzip buffers 16 16k; 


gatzip types text/css text/javascript text/xml text/plain text/x-component 
application/javascript application/json application/xml application/rss+xml font/ 
truetype font/opentype application/vnd.ms-fontobject image/svg+xml application/octet- 
stream image/jpeg; 


gatzip chunk size 64k; 
qatzip stream size 2092032; 
gatzip sw threshold 2k 


Nginx Configuration File (Cont.) 


load module modules/ngx ssl engine qat module.so; 


ssl enginet 
use engine qat; 
qefault algorithms ALL; 
qat engine( 
qat offload mode async; 
qat notify mode poll; 


qat poll mode heuristic; 


Performance Test Setup 


A -9 


6138 
tietarako 
Intel® 


4 x copies of ./ab running 
simultaneously 
Intel® QuickAssist Adapter 
x16 PCllanes, 


UEA 2400 MHz 16 GB 


(16 cores used for Nginx 
tasks) 


Performance Test Input 


Four files: 
- front pages from English language news sites 


* ABC News* First 5,441 bytes 
* Cable Network News” (CNN) 137,231 bytes 
° Times of India* 372,028 bytes 
* Shanghai Daily* 64,137 bytes 


Tests ran concurrent with four copies of Apache Benchmark (ab). 
Example: 


ab =n 100000 =c 100 =Z AES128-SHA -f TLS1.2 =H 'Accept Encoding:gzip,deflate' https:// 
172.16.1.1:4400/<name of file> 


10 


Performance Test ARESULES uro 
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1 x Intel® Xeon® Gold 6138P Processor @ 2.0 GHz, Intel® QuickAssist Adapter 8970, x16 PCI lanes, DDR4 2400 MHz 16 GB 
16 cores used for Nginx* tasks 


1. Results have been estimated based on internal Intel analysis and are provided for informational purposes only. Any difference in system hardware or software design or configuration may affect actual performance. 
2. All products, platforms, dates, features, and figures specified are preliminary based on current expectations and are subject to change without notice. 
3. Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. 


Performance tests, such as SYSmark* and MobileMark*, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You 
should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more complete 
information visit www.intel.com/benchmarks. 


Configurations: See slides 9 and 10 for configurations. Testing performed by Intel. 
4. Performance results are based on testing as of September 25, 2018 and may not reflect all publicly available security updates. See configuration disclosure for details. No component or product can be absolutely secure. 
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CPU Usage (40 Gbps LAN) 
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Intel@ QAT 
1 x Intel@ Xeon@ Gold 6138P Processor a. 2.0 GHz, Intel® QuickAssist Adapter 8970, x16 PCI lanes, DDR4 2400 MHz 16 GB 


H * 

16 cores used for Nginx* tasks 

1. Results have been estimated based on internal Intel analysis and are provided for informational purposes only. Any difference in system hardware or software design or configuration may affect actual performance. 

2. All products, platforms, dates, features, and figures specified are preliminary based on current expectations and are subject to change without notice. 

3. Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. 
Performance tests, such as SYSmark” and MobileMark*, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You 
should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more complete 
information visit www.intel.com/benchmarks. 


Configurations: See slides 9 and 10 for configurations. Testing performed by Intel. 
Performance results are based on testing as of September 25, 2018 and may not reflect all publicly available security updates. See configuration disclosure for details. No component or product can be absolutely secure. 
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Core Scaling Results 


NGINX Secure TLS+GZip Webserver: Book1 Calgary Corpus File 760K 
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1C2T 2CAT 4C8T 6C12T 8C16T | 10C20T | 12C24T | 14C28T | 16C32T | 18C36T | 19C38T 
m Core (s) 2,149.78 | 1,067.46 537.39 | 360.50 | 271.60 | 21686 181.15 15630 | 13639 | 12149 | 115.51 
m QAT (s); 276.34 138.81 69.01 46.28 34.77 27.82 23.56 21.42 20.14 20.01 19.76 


Completion Time (s) Lower is Better 


1 x Intel® Xeon® Gold 6138P Processor @ 2.0 GHz, Intel® QuickAssist Adapter 8970, x16 PCI lanes, DDR4 2400 MHz 16 GB 
16 cores used for Nginx* tasks 


1. Results have been estimated based on internal Intel analysis and are provided for informational purposes only. Any difference in system hardware or software design or configuration may affect actual performance. 
2. All products, platforms, dates, features, and figures specified are preliminary based on current expectations and are subject to change without notice. 
3. Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. 


Performance tests, such as SYSmark” and MobileMark*, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You 
should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more complete 
information visit www.intel.com/benchmarks. 


Configurations: See slides 9 and 10 for configurations. Testing performed by Intel. 
4. Performance results are based on testing as of September 25, 2018 and may not reflect all publicly available security updates. See configuration disclosure for details. No component or product can be absolutely secure. 
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2 x Intel® Xeon® Gold 6152 Processor HO @ 2.1 GHz, Intel C627 Chipset B1, x16+x8 PCI lanes, DDR4 2400 MHz 16 GB 


22 cores used for Nginx* tasks 

1. Results have been estimated based on internal Intel analysis and are provided for informational purposes only. Any difference in system hardware or software design or configuration may affect actual performance. 
2. All products, platforms, dates, features, and figures specified are preliminary based on current expectations and are subject to change without notice. 

3. Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. 


Performance tests, such as SYSmark* and MobileMark*, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should 
consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more complete information visit 
www.intel.com/benchmarks. 


Configurations: See slide 9 and 10 for configuration. Testing performed by Intel. 
4. Performance results are based on testing as of September 1, 2017 and may not reflect all publicly available security updates. See configuration disclosure for details. No component or product can be absolutely secure. 
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